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BACKGROUND OF THE INVENTION 
Field of the Invention 

The present invention relates to code-excited linear prediction (CELP) 
speech processing. Specifically, the present invention relates to translating 
digital speech packets from one CELP format to another CELP format. 

Related Art 

Transmission of voice by digital techniques has become widespread, 
particularly in long distance and digital radio telephone applications. This, in 
turn, has created interest in determining the least amount of information which 
can be sent over the channel while maintaining the perceived quality of the 
reconstructed speech. If speech is transmitted by simply sampling and 
digitizing, a data rate on the order of 64 kilobits per second (kbps) is required to 
achieve a speech quality of a conventional analog telephone. However, through 
the use of speech analysis, followed by the appropriate coding, transmission, 
and resynthesis at the receiver, a significant reduction in the data rate can be 
achieved. 

Devices which employ techniques to compress voiced speech by 
extracting parameters that relate to a model of human speech generation are 
typically called vocoders. Such devices are composed of an encoder, which 
analyzes the incoming speech to extract the relevant parameters, and a decoder, 
which resynthesizes the speech using the parameters which it receives over a 
channel, such as a transmission channel. The speech is divided into blocks of 
time, or analysis subframes, during which the parameters are calculated. The 
parameters are then updated for each new subframe. 

Linear-prediction-based time domain coders are by far the most popular 
type of speech coder in use today. These techniques extract the correlation from 
the input speech samples over a number of past samples and encode only the 
uncorrelated part of the signal. The basic linear predictive filter used in this 
technique predicts the current sample as a linear combination of the past 
samples. An example of a coding algorithm of this particular class is described 
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the vibration of the vocal cords hTf^l p "'I '""f tenn redundancies «° 
by two filters a Z^T « ^ "P***™ are modeled 

theTl^' a . Sh0rt ■ ,em, fonnant filt - — » '°ng-term pitch filter. Once 

as wmte gaussian noise, which is also encoded 

filters ^effi " H TfT" * * ^P"* ** ° f ^° 

n^L ^* fonnant ^ <ak ° *"<"» 38 *• "LPC (linear 

predion coefficients) filter^, performs short . tem) < 

* ,?» ^ *■ pitch filter, performs tn^ 

^ ^ de j eminin 8 -hi* °ne of a number of random excitation 

~T ™ u °° k ^ fa ** dOSeSt applanation to me original 
M «h» the waveform excites the two filters mentioned above. ThusTe 

a^T P 7TT relafet0toee itemS (1 > *« ^ Wter, (2) me pitch filter 
z o and (3) the codebook excitation. 

Digital speech coding can be broken in two parts; encoding and 
decoding, sometimes known as analysis and synthesis. FIG. 1 is a block 
diagram of a system 100 for digitally encoding, transmitting and decoding 

25 S , ™! SyStCm ildUdeS 3 C ° der 102 ' 3 dumnd 104 ' a decoder 106. 

Channel 104 can be a communications channel, storage medium, or theiike. 
Coder 102 receives digitized input speech, extracts the parameters describing 
the features of the speech, and quantizes these parameters into a source bit 
stream that is sent to channel 104. Decoder 106 receives the bit stream from 
channel 104 and reconstructs the output speech waveform using the quantized 

30 features in the received bit stream. 

Many different formats of CELP coding are in use today. In order to 
successfully decode a CELP-coded speech signal, the decoder 106 must employ 
the same CELP coding model (also referred to as "format") as the encoder 102 
that produced the signal. When communications systems employing different 
CELP formats must share speech data, it is often desirable to convert the speech 
signal from one CELP coding format to another. 

One conventional approach to this conversion is known as "tandem 
coding." FIG. 2 is a block diagram of a tandem coding system 200 for 
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converting from an input CELP formatto an output CELP format Th 
includes an input CELP for™.* j j . >ormat. The system 

encoder 202. Input C^ d h ,nf "* *" ° U,PUt ^ for ™' 

.o hereinafter a! theTpu^T^t n^sT^ 3 T* ^ < " t ™ d 
forma, (referred to herein^ at^e .Wu^ T °*« °~ ^ 
input sienal to m-nrf P format). Decoder 206 decodes the 

fonna, (referred to i^T^I^r 11 ^ ^ ^ 
signa: in *. outpu, forma,, rne prtaXd vtT 8 e I 

SUMMARY OF THE INVENTION 

CELP-based vocoder packet translation. The apparatus includes a format 
= ertransW 

fo^alf ^ ^ T - ^ fonnat t0 ^ ° UtpUt CELP fomat to P»*« -*ut 
^ttt b r f ^ ^ eXdtati ° n P— - translator that translates 
S^eTp f C ° COrreS P°^ » *■ speech packet from 

flwinputCELP format to the output CELP format to produce output pitch and 
codebook parameters. The formant parameter translator includes a model 

iL co rr r y converts modei ° rder ° f ** *™ 

coeffioents from the model order of the input CELP format to the model order 
o *e output CELP format and a time base converter that converts the timetse 
of the mput formant filter coefficients from the time base of the input CELP 
format to the rime base of the output CELP format. 

The method includes the steps of translating the formant filter 
coefficients of the input packet from the input CELP format to the output CELP 
format and translating the pitch and codebook parameters of the input speech 
packet from the input CELP format to the output CELP format. The step of 
translating the formant filter coefficients includes the steps of translating the 
formam filter coefficients from input CELP format to a reflection coefficient 
CELP format, converting the model order of the reflection coefficients from the 
model order of the input CELP format to the model order of the output CELP 
format, translating the resulting coefficients to a line spectral pair (LSP) CELP 
format, converting the time base of the resulting coefficients from the input 
CELP format time base to the output CELP format time base, and translate the 
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resulting coefficients from LSP format to the output CELP for™* ♦ 

output fbrmant filter coefficients Th„ «. r ° P roduce 

codebook n aram ♦ • ste P of translating the pitch and 

Ph* Id IhT r St6PS ° f ^-"ng speech using the input 

^t^T rr , * B t0 Pr ° dUCe 3 targCt *** ** Sear ^S 

Z^^T parameters — - *** ^ - 

de CTa d^n adVantaSe ° f 18 * at * eliminates the 

Nation m PerCSPtUal » mduced ^ t-dem coding 

10 

BRIEF DESCRIPTION OF THE FIGURES 

The features, objects, and advantages of the present invention will 
become more apparent from the detailed description set forth below when 
taken m conjunction with the drawings in which like reference characters 
identify correspondingly throughout and wherein: 

and dec^sp^f ^ * * ^ *" 

20 an innu^T ^ * SyStem for Verting from 

20 an input CELP format to an output CELP format; 

FIG. 3 is a block diagram of a CELP decoder; 
FIG. 4 is a block diagram of a CELP coder; 
^ FIG. 5 is a flowchart depicting a method for CELP-based to CELP-based 
vocoder packet translation according to an embodiment of the present 
25 invention; 

FIG. 6 depicts a CELP-based to CELP-based vocoder packet translator 
according to an embodiment of the present invention; 

FIGS. 7, 8, and 9 are flowcharts depicting the operation of a formant 
parameter translator according to an embodiment of the present invention; 
30 Ha 10 18 a flowchart depicting the operation of an excitation parameter 

translator according to an embodiment of the present invention; 

FIG. 11 is a flowchart depicting the operation of a searcher; and 

FIG. 12 depicts an excitation parameter translator in greater detail. 

35 DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The preferred embodiment of the invention is discussed in detail below. 
While specific steps, configurations and arrangements are discussed, it should 



BNSDOCID: <WO 0048170A1_L> 



WO 00/48170 



PCT/USOO/03855 



10 



15 



20 



25 



30 



35 



L i^^jr^ or mustrative purposes oruy - a ~ ^ 

arraneemenT t , *" ^ Ste P S ' -Rations and 

™ r ^ ^ With ° Ut fr ° m * e -Pi* scope of the 

mfolano 7 ^ PreS6nt ^ USe * * vLety of 

t^ COmmUniCati0n S ~ -tellite and terJtrial 

ceUular telephone systems. A preferred application is in CDMA wireless 
spread spectrum communication systems for telephone service 

The present invention is described in two parts. First, a CELP codec 
mcludmg a CELP coder and a CELP decoder, is described. Then, a pack* 
translator is described according to a preferred embodiment 

Before describing a preferred embodiment, an implementation of the 

CE^oTer r^r ° fHG1 *** - L implementa^n 

2^1 A , ^ ^ "^"W^ -ethod to encode a speech 
signal. According to this method, some of the speech parameters are computed 

Z ? T P maimer ' ^ ^ arC dete ™ ed - * closed-loop mode by 
trial and error. Specifically, the LPC coefficients are determined by solving a set 
of equations. The LPC coefficients are then applied to the formant filter Then 
hypothetical values of the remaining parameters (codebook index, codebook 
gain, pitch lag, and pitch gain) are used with the formant filter to synthesize a 
speech signal. The synthesized speech signal is then compared to the actual 
speech signal to determine which of the hypothetical values of the remaining 
parameters synthesizes the most accurate speech signal. 

A Code Excited Linear Predictive (CELP) Decoder 

The speech decoding procedure involves unpacking the data packets, 
unquantizing the received parameters, and reconstructing the speech signal 
from these parameters. The reconstruction consists of filtering the generated 
codebook vector using the speech parameters. 

FIG. 3 is a block diagram of a CELP decoder 106. CELP decoder 106 
includes a codebook 302, a codebook gain element 304, a pitch filter 306, a 
formant filter 308, and a postfilter 310. The general purpose of each block is 
summarized below. 

Formant filter 308, also referred to as an LPC synthesis filter, can be 
thought of as modeling the tongue, teeth and Hps of the vocal tract, and has 
resonant frequencies near the resonant frequencies of the original speech caused 
by the vocal tract filtering. Formant filter 308 is a digital filter of the form 
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l/A(z) = l-a j2 1 -...-a n 2* (1) 

'S^Z-JS^ » « ~ - a, Want fUter 
^g^I^^ * «~~ P- *ain 

of - W the 1^. Examples of voiced sounds are the O in "low" and the A 
m day. Dunng unvoiced speech, the pitch filter basically passes the b*£t 
^eoutput unchanged. Unvoiced speech is produced by L^uZgh I 

r^rT^^^^ ^-ofunvoLsoundst 

^1?*^^? a c ;r iction between tongue ^ u ^ er 

a tne ^ m shuffle/ fomed by & constriction betw 
upperteeth. Pitch filter 306 is a digital filter of the form P 
l/P(z) = l/(l. b O = l + bzSb'z'S... 

where b is referred to the pitch gain of the filter and L is the pitch lag of the 

. C ° deh °°* 3 °f be *«<6* of as modeling the turbulent noise in 
unvoiced speech and the excitation to the vocal cords in voiced speech. During 
background noise and silence, the codebook output is replaced by random 

vXs C 1 2 St ° reS a nUmb6r ° f data ™ rds to as codebook 

vectors^ Codebook vectors are selected according to a codebook index I. The 
selected codebook vector is scaled by gain element 304 according to a codebook 
gam parameter G. Codebook 302 may include gain element 304. The output of 
the codebook is then also referred to as a codebook vector. Gain element 304 
can be implemented, for example, as a multiplier. 

Postfilter 310 is used to "shape" the quantization noise added by the 
parameter quantization and imperfections in the codebook. This noise can be 
noticeable in frequency bands which have little signal energy, yet might be 
imperceptible in frequency bands which have large signal energy. To take 
advantage of this property, postfilter 310 attempts to put more quantization 
noise mto perceptually insignificant frequency ranges, and less noise into 
perceptually significant frequency ranges. This postfiltering is discussed 
further in J-H. Chen & A. Gersho, "Real-Time Vector APC Speech Coding at 
4800 bps with Adaptive Postfiltering," in Proc. ICASSP (1987) and N.S. Jayant & 
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• . racn «">»aine, a set of speech parameters is applied to CFT P 
decoder 106 to generate one subframe of synthesized speech -(nf ^e s^ 
Parameters include codebook index I, codebook gain Glitch lag 

b, and formant filter coefficient a, ... a„. One vector of codebook 302 ^eSS 
««Ng to index I, scaled according to gain G, and used to excite ^ m*r 
306 and forman, futer 308. Pitch filter 306 operates on the selected codebTk 
vector according to pitch gain b and pitch lag L. Formant filter 308 opera e^n 
*e s,gnal generated by pitch filter 306 according to formant filter coefficient 

• a„ to produce synthesized speech signal • (n). 

A Code Excited Linear Predictive (CELP) Coder 

The CELP speech encoding procedure involves determining the input 
parameters for the decoder which minimize the perceptual different between a 
synthesized speech signal and the input digitized speech signal. The selection 
processes for each set of parameters are described in the following subsections. 
The encodmg procedure also includes quantizing the parameters and packing 
them mto data packets for transmission, as would be apparent to one skilled in 
the relevant arts. 

FIG. 4 is a block diagram of a CELP coder 102. CELP coder 102 includes 
a codebook 302, a codebook gain element 304, a pitch filter 306, a formant filter 
308, a perceptual weighting filter 410, an LPC generator 412, a summer 414, and 
a inmimization element 416. CELP coder 102 receives a digital speech signal 
s(n) that is partitioned into a number of frames and subframes. For each 
subframe, CELP coder 102 generates a set of parameters that describe the 
speech signal in that subframe. These parameters are quantized and 
transmitted to a CELP decoder 106. CELP decoder 106 uses these parameters to 
synthesize the speech signal, as described above. 

Referring to FIG. 4, the generation of LPC coefficients is performed in an 
open-loop mode. From each subframe of input speech samples s(n) LPC 
generator 412 computes LPC coefficients by methods well-known in the 
relevant art. These LPC coefficients are fed to formant filter 308. 

The computation of the pitch parameters b and L and codebook 
parameters I and G however, is performed in a closed-loop mode, often referred 
to as an analysis-by-synthesis method. According to this method, various 
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combinations of suess codebook m d p.** d -^m' ' fferent 

5^rToT erated by « — ~ - - ~ - 

10 In the embodiment depicted in FIG. 4, the input speech samples sfnl 

wetted by perceptual weighting filter 410 soltl w^ t ell 
samples are provided to sum input of adder 414. Perceptual wethlet 
ut^ed to weight the error at the frequencies where there is'less s£^J 

noH STn P ° Wer ** ** ™* is ™« P-eptuaUy 

Pat^N ^ 7Q P r ePtUal WCiShtinS ^ ^ d ™ d * ^States 
Patent No 5,414,796 entitled "Variable Rate Vocoder," which is incorporated by 
reference herein in its entirety. 7 

Minimization element 416 conducts the search for the codebook and 
pit* parameters in two stages. First, minimization element 416 searches for the 
patch parameters. During the pitch search there is no contribution from the 
codebook (G = 0). In minimization element 416 all possible values for the pitch 
lag parameter L and the pitch gain parameter b are input to pitch filter 306 
Mmunization element 416 chooses the values of L and b that minimize the error 
r(n) between the weighted input speech and the synthesized speech. 

25 Once the pitch lag L and the pitch gain b for the pitch filter are found the 

codebook search is performed in a similar manner. Minimization element 416 
then generates values for codebook index I and codebook gain G. The output 
values from codebook 302, selected according to the codebook index I are 
multiplied in gain element 304 by the codebook gain G to produce the sequence 

30 of values used in pitch filter 306. Minimization element 416 chooses the 

codebook index I and the codebook gain G that minimize the error r(n). 

In one embodiment, perceptual weighting is applied to both the input 
speech by perceptual weighting filter 410 and the synthesized speech by a 
weighting function incorporated within formant^fHter 308. In an alternative 

15 embodiment, perceptual weighting filter 410 may be placed after adder 414. 
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CELP-based to CELP-based Vocoder Packet Translation 

Ltfce^se, the result of the translation is referred to as the "output" packet 
havmg an output" CELP fonna, that specifies "output" codebook and Let 

suTT T ° UtPU '" f0nnimt fflter CM£fiden *- °~ «*> 

x^^T „ '° T*" " ,ekph ° ne ~ to Fernet" ^ 

exchanging speech signals. 

FIG. 5 is a flowchart depicting the method according to a preferred 
embodim«it. The translation proceeds in three stages. In the first stage, the 

tzr^ de ? of *• * put speech packet « — 

input CELP format to the output CELP format, as shown in step 502. In the 
second stage, the pitch and codebook parameters of the input speech packet are 
translated from the input CELP format to the output CELP format, as shown in 
step 504. In the third stage, the output parameters are quantized with the 
output CELP quantizer. 

20 HG - 6 de P icts a P ack et translator 600 according to a preferred 

embodiment. Packet translator 600 includes a formant parameter translator 620 
and an excitation parameter translator 630. Formant parameter translator 620 
translates the input formant filter coefficients to the output CELP format to 
produce output formant filter coefficients. Formant parameter translator 620 
includes a model order converter 602, a time base converter 604, and formant 
filter coefficient translators 610A,B,C. Excitation parameter translator 630 
translates the input pitch and codebook parameters to the output CELP format 
to produce output pitch and codebook parameters. Excitation parameter 
translator 630 includes a speech synthesizer 606 and a searcher 608. FIGS 7 8 

30 and 9 are flowcharts depicting the operation of formant parameter translator 

620 according to a preferred embodiment. 

Input speech packets are received by translator 610A. Translator 610A 
translates the formant filter coefficients of each input speech packet from the 
mput CELP format to a CELP format suitable for model order conversion. The 

35 model order of a CELP format describes the number of formant filter 

coefficients employed by the format. In a preferred embodiment, the input 
formant filter coefficients are translated to reflection coefficient format, as 
shown in step 702. The model order of the reflection coefficient format is 
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tZ. uJ27r Wo m ° del Mder ° f *— * -fficien, 

relevant ^ ^^'^^ ««— *« - -eU-teown „ me 

diluents, tfiis translation is unnecessary. 

transW^Z™.^ *« — *» from 

the model order of ^ '"'7^7 refleCtion «*<rMenrs from 

CELP form^ ^ ^ ,° ? ^ * ° U ^' 

forma, is lo wer man the model order C^pT ^ ^ 

of the ™*,„t rriD. P format is higher than the model order 

of the output CELP forma,, dedmator 614 performs a decimation operation to 
«duce the number o, coefficients, as shown in step 804. fa o^el^nC 
^unnecessary coefficients are simp,y replaced by zeroes. Such inte^oZn 
and I ****** operations are weU-known in the relevant arts, fa £ 
coefficen, reflection domain model, order conversion is relatively stapT 
™*ng * a Kkely choice. Of course, i, the model orders o, the mput anVoZu, 
CELP formats are the same, model order conversion is unnecessary 

from mo^H rKeiVeS ^ ° rd «— «ed formar,, fitter coefficients 
^model order converter 602 and translates the coefficients from the 
reflect™ coefficent format „ a CELP format suitable for time base conversion. 
•O. ^ time base of a CELP forma, describes the rate a, which me forman, 
£*T Pa " meters m -W U. the number of vectors per second of 
fennant synthes* parameters. In a preferred embodiment, the reflection 

^tr, l ° "« s P« tral (LSP) format, as shown in step 

706. Methods for performing such a translation are well-known in the relevant 

dXt* 

ftiim ^ taSe C ° nVerter 604 rCCeiVeS ^ ^cients from translator 
610B and converts the time base of the LSP coefficients from the time base of the 

7 U tn?^ ^ t0 *** ^ ° f ° Ut P Ut foimat ' as shown in 

II? ^ C ° nVerter 604 indudes 311 ^'Polator 622 and a decimator 

624 When the time base of the input CELP format is lower than the time base 
of the output CELP format (i.e., uses fewer samples per second), interpolator 
622 performs an interpolation operation to increase the number of samples as 
shown in step 902. When the time base of the input CELP format is higher than 
the model order of the output CELP format (i.e., uses more samples per 
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are well-known in the relevant arts. Ot 21 , T" 

CEU> format is the same as the time baseTme ^ 

base conversion is necessary. * format ' no 

coeffideTr^JT *-»»* filter 

coefficient as shownT L^o T ° Utpm f ~ 

Quantizer 6 „ receives the output forman, filter coefficients from £££ 
610C and ,uan tae s the output fonnant filter coefficients, as shown in s^l 2 

In the second stage of translation, the pitch and codebook Daram«*L 
(ako referred to as "excitation" parameters) I the input spTcn paTfZ 
translated from the input CEU> format to the output CELP to^lT 

eodeJt 8 6 ' SpeeCh 606 reives the pitch and 

codebook parameters of each input speech packet. Speech synthLzer ^06 

Smat^fflt SPeeCh l : gna, ' " , ™ d to " ^ W theTutP^ 

foment filter coefficients, which were generated by formant paraJL 

tonslator 620, and the input codebook and pitch excitation parameters as 
*T " S, f 10 f ™- - 1004, searcher 608 obtains the S£ 
c^ebook and pitch parameters using a search routine simitar to mat used by 
C1LP decoder 106, described above. Searcher 608 then quantizes the output 

parameters. F 



HG. 11 is a flowchart depicting the operation of searcher 608 according 
to a preferred embodiment of the present invention. In this search, searcher 608 

T^iT 31 * ^ C ° effidents S-^d by formant parameter 
transktor 620 and the target signal generated by speech synthesizer 606 and 
candidate codebook and pitch parameters to generate a candidate signal, as 
shown m step 1104. Searcher 608 compares the target signal and the candidate 
signal to generate an error signal, as shown in step 1106. Searcher 608 then 
vanes the candidate codebook and pitch parameters to nunimize the error 
signal, as shown in step 1108. The combination of pitch and codebook 
parameters that minimizes the error signal is selected as the output excitation 
parameters. These processes are described in greater detail below. 



BNSDOCID: < WO 0O48 1 70A 1 _l_> 



WO 00/48170 



PCT/USOO/03855 



s^thesize^andasea^OS. ReferringtoH ° 

mcludes a codebook 302A, a gain element 304A, a pit* fflteT^TL 

exatahort parameters and fnrm^f . 6 on 

decoder lOfi c !fi „ " " w ~ iKMSas > 35 described above for 

decoder 106. Specmcally, speech synthesizer 606 generates a target signal sJn) 
usmg the .put excitation parameters and the output formant fill coeTftcie^ 
^ codebook index I, is applied to codebook 302A to generate a codebook 

7T? v V6Ct0r b SCaled by ^ Clement 304A input 

codebook gam parameter G r Pitch filter 306A generates a pitch signal using the 

scaled codebook vector and input pitch gain and pitch lag parameters b, and L,. 
Formant filter 308A generates target signal s^n) using the pitch signal and the 
output formant filter coefficients ^ ... a£)n generated by formant parameter 
translator 620. Those of skill would appreciate that the time base of the input 
and output excitation parameters can be different, but the excitation signal 
produced is of the same time base (8000 excitation samples per second in 
accordance with one embodiment). Thus, time base interpolation of excitation 
parameters is inherent in the process. 

20 Searcher 608 includes a second speech synthesizer, a summer 1202, and a 

mirumization element 1216. The second speech synthesizer includes a 
codebook 302B, a gain element 304B, a pitch filter 306B, and a formant filter 
308B. The second speech synthesizer produces a speech signal based on 
excitation parameters and formant niter coefficients, as described above for 

25 decoder 106. 

Specifically, speech synthesizer 606 generates a candidate signal s c (n) 
using candidate excitation parameters and the output formant filter coefficients 
generated by formant parameter translator 620. Guess codebook index I,. is 
applied to codebook 302B to generate a codebook vector. The codebook vector 
30 is scaled by gain element 304B using input codebook gain parameter G G . Pitch 

filter 306B generates a pitch signal using the scaled codebook vector and input 
pitch gain and pitch lag parameters b c and Formant filter 308B generates 
guess signal s c (n) using the pitch signal and the output formant filter 
coefficients a D1 ... a^. 

35 Searcher 608 compares the candidate and target signals to generate an 

error signal r(n). In a preferred embodiment, target signal s^n) is applied to a 
sum input of a summer 1202, and guess signal s c (n) is applied to a difference 
input of summer 1202. The output of summer 1202 is the error signal r(n). 
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Error signal r(n) is provided tr» a ~, • • 
Mirumization element 1216 sIcT^ . »— «— *» element me. 

v_iiL»r coder 102. The codebook and pitch parameters that result 

STJ^rT T ,uantiied md used ^ 4,16 f ~ co^" 

£2T£f. ^ 11,6 fomUmt P aram «« of packet 

translator 600 to produce a packet of speech in the output CELP format. 
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Conclusion 14 

a in the art, and the generic prmciples defined herein mmv «. . . 

utner embodiments wifKm,+ +1 , ., . ; «e a Fpeu lo 

invention is J^^^Z^^TT:^ ^ *" 

t« 1,0 . , , ea to De ^ted to the embodiments shown herein but is 

.0 lisitr - scope * - -» - -1 
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CLAIMS 

What Is Claimed Is: 



4 



6 



6 



onecodeexcit^Z , " 8 * COm P ressed packet from 

one code excited linear prediction (CELP) format to another, comprising- 

a formant parameter translator that translates input formant filter 
coefficients having an input CELP format and corresponding to a s P 3 P a*e 
to an output CELP format to produce output form anffilter Ifncien^d 

an excitation parameter translator that translates input pitch and 
codebook parameters having an input CELP format and correspond^ s^d 
speech packet to said output CELP format to r, rn A F S S3ld 

codebook parameters. P ^ ° UtpUt P itch 

translaLcom^r^ * ^ ^ ^ *™ P™ 

formant ST" ^ COnVertS ** ™ del ° rder of said input 

formant Mter coefficients from a model order of said input CELP format to a 
model order of said output CELP format; and 

a tone base converter that converts the time base of said input formant 
filter coeffiaents from a time base of said input CELP format to a time base of 
said output CELP format. 

3. The apparatus of claim 2, where said excitation parameter 
translator comprises: 

a speech synthesizer that produces a target signal using said input pitch 
and codebook parameters and said output formant filter coefficients; and 

a searcher that searches for said output codebook and pitch parameters 
using said target signal and said output formant filter coefficients. 

4. The apparatus of claim 3, wherein said searcher comprises: 

a further speech synthesizer that generates a guess signal using guess 
excitation parameters and said output formant filter coefficients; 

a combiner that generates an error signal based on said guess signal and 
said target signal; and 

a irdrumization element that varies said guess excitation parameters to 
nunimize said error signal. 
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16 
18 



f u rt he r S co mpr ^: aPParato ° f 3 ' ***** ™* «— converter 

to pro d uce^Lr^^ fomatpriM,0 ^ b ^ id ^»er 



* further co mpr ^ : aPParahJS ° f ^ 5 ' Wte - *" d converter 

4 correct t'T'T f fateip0lateS Mid *" "efficients to produce order 
corrected coefficents when said model order of said input CELP format is 

lowermansaidmodelorderofsaidoutputCELP format; aTd 

a decimator that decimates said third coefficients to produce said order 
corrected coefficients when said model order of said input CELP formal 

• Wgherthansaidmodelorderofsaidou^utCELPformat 

2 mm 7 * ^ apparatUS of 3 - w "«<*> said speech synthesizer 

^ comprises: 

a codebook using said input codebook parameters to produce a 
4 codebook vector; ^ 

« ~* ! PitC *l ^ ^ Said pitCh filter P a «meters and said codebook 

6 vector to produce a pitch signal; and 

« * 3 £ T nimt ^ ^ ^ ° UtpUt f0rmant mter efficients and said 

8 P^ch signal to produce said target signal. 

"> 8. The apparatus of claim 7, wherein said guess excitation 

parameters include guess pitch filter parameters and guess codebook 
12 parameters, wherein said further speech synthesizer comprises: 

a further codebook using said guess codebook parameters to produce a 
14 further codebook vector; 

a pitch filter using said guess pitch filter parameters and said further 
codebook vector to produce a further pitch signal; and 

a formant filter using said output formant filter coefficients and said 
further pitch signal to produce said guess signal. 

9. The apparatus of claim 2, further comprising: 
a first formant filter coefficient translator that translates said input 
formant filter coefficients to a fourth CELP format before use by said time base 
converter. 
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10. 



The apparatus of claim 2, further comprising- 

said tiLT* f0mant ^ COeffident ** ***** the output of 

said time base converter from said fourth CELP form«f ^ -a 

format. format to said output CELP 



11. The apparatus of claim 5, wherein said third CELP format is a 
2 reflection coefficient CELP format. 

12. The apparatus of claim 9, wherein said fourth CELP format is a 
2 hne spectral pair CELP format. 

2 CELP format f ^v.^ *" * C ° m P ressed P^et from one 

2 CELP format to another, comprising the steps of: 

. (3) * anSl3ting m P ut formant ^ter coefficients corresponding to a speech 
formant filter coefficients; and F 

6 (b) translating input pitch and codebook parameters corresponding to said 
speech packet from said input CELP format to said output CELP format to 

8 produce output pitch and codebook parameters. 

14. The method of claim 13, wherein step (a) comprises the steps of. 

2 (i) converting the model order of said input formant filter coefficients from 
a model order of said input CELP format to a model order of said output CELP 
4 format; and * 

(ii) converting the time base of said input formant filter coefficients from a 
6 time base of said input CELP format to a time base of said output CELP format. 

15. The method of claim 14, wherein step (b) comprises the steps of: 

2 synthesizing speech using said input pitch and codebook parameters in 

said input CELP format and said output formant filter coefficients to produce a 
4 target signal; and 



searching for said output pitch and codebook parameters using said 
target signal and said output formant filter coefficients. 

16. The method of claim 14, wherein step (i) comprises the steps of: 
translating said input formant filter coefficients from said input CELP 
format to a third CELP format to produce third coefficients; and 
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6 



P-uceorTrco^^^^^ «* ' — — «P f ormat to 

rrLl f 6 m ^° d ° f d3im 16 ' WhGrein St6p C ° m P rises * e of: 
translating said order corrected mpf^ m . . r_.._£ r . . F 

fourth coefficients; " ~ " " ronnat t0 P roduc * 

said inoTc^ 6 ^ ° f Said ^ COeffidentS form a «™ base of 
said mput CELP format to a time base of said output CELP format to prod 
time base corrected coefficients; and rmartoprod 



uce 



8 to saidtZutCMp'f ^ COrrCCted COeffidentS *»" Said ^at 
to said output CELP format to produce said output formant filter coefficients. 

2 thesteps^of: ^ ***** " Ste P 

generating a guess signal using guess codebook and pitch parameters 
4 and said output coefficients; ^meters 

^ generating an error signal based on said guess signal and said target 
6 signal; and 5 

varying said guess codebook and pitch parameters to minimize said 
8 error signal. 

19. The method of claim 16, wherein step (i) further comprises the 
2 steps of: r 

interpolating said third coefficients to produce said order corrected 
4 coefficients when said model order of said input CELP format is lower than 

said model order of said output CELP format; and 
6 decimating said third coefficients to produce said order corrected 

coefficients when said model order of said input CELP format is higher than 
8 said model order of said output CELP format. 

20. The method of claim 16, wherein said third CELP format is a 
2 reflection coefficient CELP format. 

21. The method of claim 17, wherein said fourth CELP format is a line 
2 spectral pair CELP format. 
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TRANSLATE FORMANT FILTER COEFFICIENTS FROM 
INPUT CELP FORMAT TO OUTPUT CELP FORMAT 



502 



1 > 






TRANSLATE PITCH AND CODEBOOK PARAMETERS 
FROM INPUT CELP FORMAT TO OUTPUT CELP 


504 


> 






QufVNTi^e- output ?ARfc HfcTe*.^ wirrH 





FIG. 5 
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J^TntJl ™ MANT F,LTER COEFFICIENTS FROM 
INPUT CELP FORMAT TO REFLECTION COEFFICIENT 
CELP FORMAT 



702 



p > 


f 


CONVERT THE MODEL OR 
COEFFICIENTS FROM TH 
INPUT CELP FORMAT TO T 
OUTPUT CE 


DER OF THE REFLECTION 
E MODEL ORDER OF THE 
HE MODEL ORDER OF THE 
LP FORMAT 



TRANSLATE COEFFICIENTS TO A LINE SPECTRAL 
PAIR (LSP) CELP FORMAT 




CONVERT THE TIME BASE OF THE LSP COEFFICIENTS 
FROM THE INPUT CELP FORMAT TIME BASE TO THE 
OUTPUT CELP FORMAT TIME BASE 



708 



TRANSLATE THE COEFFICIENTS FROM LSP FORMAT 
TO THE OUTPUT CELP FORMAT TO PRODUCE 
OUTPUT FORMANT FILTER COEFFICIENTS 



710 



QuftV»TJ*E. OIHTUT foRMAfJ-r FILTER 

PpR^KAt 



-7a 



FIG. 7 
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INTERPOLATE THE COEFFICIENTS WHEN THE MODEL 

™ E mo4T H ^ E J NPUT CELP FOR MAT IS LOWER THAN 
THE MODEL ORDER OF THE OUTPUT CELP FORMAT 



DECIMATE THE COEFFICIENTS WHEN THE MODEL 
ORDER OF THE INPUT CELP FORMAT IS HIGHER THAN 
THE MODEL ORDER OF THE OUTPUT CELP FORMAT 
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FIG. 8 
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INTERPOLATE THE SAMPLES WHEN THE TIME BASE 
OF THE INPUT CELP FORMAT IS LOWER THAN THE 
MODEL ORDER OF THE OUTPUT CELP FORMAT 



DECIMATE THE SAMPLES WHEN THE TIME BASE OF 
THE INPUT CELP FORMAT IS HIGHER THAN THE 
MODEL ORDER OF THE OUTPUT CELP FORMAT 
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FIG. 9 
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SYNTHESIZE SPEECH USING THE INPUT PITCH AND 
CODEBOOK PARAMETERS TO PRODUCE A TARGET 

SIGNAL 



SEARCH FOR THE OUTPUT PITCH AND CODEBOOK 
PARAMETERS USING THE TARGET SIGNAL AND SAID 
OUTPUT FORMANT FILTER COEFFICIENTS 
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FIG. 10 
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GENERATE A TARGET SIGNAL USING INPUT 
CODEBOOK AND PITCH PARAMETERS AND OUTPUT 
COEFFICIENTS 



GENERATE A CAftMofli* SIGNAL USING CAfsfi>i*>ftT£ 
CODEBOOK AND PITCH PARAMETERS AND OUTPUT 
COEFFICIENTS 



GENERATE AN ERROR SIGNAL BASED ON CAWoiOA* 
SIGNAL AND TARGET SIGNAL 



1102 



1104 



1106 



VARY THE CAMwotocODEBOOK AND PITCH 
PARAMETERS TO MINIMIZE THE ERROR SIGNAL 



1108 



FIG. 11 
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CLEP TRANSCODING 

BACKGROUND OF THE INVENTION 
Field of the Invention 

The present invention relates to code-excited linear prediction (CELP) 
speech processing. Specifically, the present invention relates to translating 
digital speech packets from one CELP format to another CELP format. 

Related Art 

Transmission of voice by digital techniques has become widespread, 
particularly in long distance and digital radio telephone applications. This, in 
turn, has created interest in determining the least amount of information which 
can be sent over the channel while maintaining the perceived quality of the 
reconstructed speech. If speech is transmitted by simply sampling and 
digitizing, a data rate on the order of 64 kilobits per second (kbps) is required to 
achieve a speech quality of a conventional analog telephone. However, through 
the use of speech analysis, followed by the appropriate coding, transmission, 
and resynthesis at the receiver, a significant reduction in the data rate can be 
achieved. 

Devices which employ techniques to compress voiced speech by 
extracting parameters that relate to a model of human speech generation are 
typically called vocoders. Such devices are composed of an encoder, which 
analyzes the incoming speech to extract the relevant parameters, and a decoder, 
which resynthesizes the speech using the parameters which it receives over a 
channel, such as a transmission channel. The speech is divided into blocks of 
time, or analysis subframes, during which the parameters are calculated. The 
parameters are then updated for each new subframe. 

Linear-prediction-based time domain coders are by far the most popular 
type of speech coder in use today. These techniques extract the correlation from 
the input speech samples over a number of past samples and encode only the 
uncorrelated part of the signal. The basic linear predictive filter used in this 
technique predicts the current sample as a linear combination of the past 
samples. An example of a coding algorithm of this particular class is described 
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in the paper "A 4.8 kbps Code Excited Tin. r> „ 

5 in speech. Speech tvpLh, > natUra ' red ™dancies inherent 

m.eL g opeC rrer^lr.^^ *» — ^ to the 
•he vibration of the vocal ^"LT^Tl ™h redundancies due * 
by two filters a short ' 686 0 P era «°™ are modeled 

*L reaunTan^ ^o^I » „' -fT" ^ ^ 

as white gaussian noise , which is 1 e^f ^ "° d * d 

filters T S^2! °' "t " to COmpUte * e «* digital 

^TTdo^ rf 7 f ^ *«" fflters mu * * excite!, and 

Uus is done by determining which one of a number of random excitation 
waveforms in a codebook results i n *h. -I- . exatanon 
speech wh m «. T the closest approximation to the original 

^ee* when the waveform excites the two filters mentioned above. ThufTe 
transmitted parameters relate to three items (1) the LPC filter m », Jul, 
*» and (3) the codebook excitation. (1 ' the IJ>C filter, (2) the prtch filter 

decnH • WgW ***** "** ^ br0km ta «»» P«* encoding and 
decodmg, sometimes known as ana.ysis and synthesis. FIG. 1 is a b^ck 

Ctantl ™ 3 e °* r 102 - 3 ChMmeI 104 - *» d a decoder !06 

Channel 1104 can be a communications channel, storage medium, or the like 
Coder 102 receives digitized input speech, extra* the parameters descriMng 
defeatures of the speech, and quantizes these parameters into a so^b* 
*-« .that is sent to channel 104. Decoder 106 receives the bit stream from 
channel 104 and reconstructs the output speech waveform using the quantized 
3« features in the received bit stream. quantized 

Many different formats of CELP coding are in use today. In order to 
^ccessfuUy decode a CELP-coded speech signal, the decoder 106 must employ 
the same CELP coding model (also referred to as "format") as the encoder 102 
ftat produced the signal. When commumcations systems employing different 
CELP formats must share speech data, it is often desirable to convert the speech 
signal from one CELP coding format to another. 

One conventional approach to this conversion is known as "tandem 
«xl»8- FIG. 2 is a block diagram of a tandem coding system 200 for 
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encoded. CS^^^f",^* ^ " ° U * Ut ^ *>™ at 
.o hereinafter as ^pu^^ f T*" 3 ^ ^ 
format (^d ,„ ta l ^ " ^ -coded u^g one cbu- 

input sienal to * P formal) - Decoder 206 decodes the 

i-t ar^ras jar jsrr^" 

S»r^,srL" r 't; *— — - « 

perceptual d«J^ h ? 17 dlSadvanta S e of thk approach is the 

SUMMARY OF THE INVENTION 

CELPb^l/^ ^"f n * 3 meth ° d a PP aratUS for CELP-based to 
CELP-based vocoder packet translation. The apparatus includes a fcj 

parameter translator that translates input formatter coefficients for 

£1 £ " "T--™* f0nnat t0 ^ ° UtpUt CELP fonnat to P rod - output 
\T 7 ^ ^ ^ eXdtati ° n ParametCT ^tor that translates 
Zl^P , r 3meterS COrreS P° nd -6 to ^ speech packet from 

the mput CELP format to the output CELP format to produce output pitch and 
codebook parameters. The formant parameter translator includes a model 

c~m "TT y C ° nVertS m ° del ° rder ° f m P ut t filter 

coeffioents from the model order of the input CELP format to the model order 

of output CELP format and a time base converter that converts the time base 
of the mput formant filter coefficients from the time base of the input CELP 
format to the time base of the output CELP format. 

The method includes the steps of translating the formant filter 
coefficients of the input packet from the input CELP format to the output CELP 
format and translating the pitch and codebook parameters of the input speech 
packet from the input CELP format to the output CELP format. The step of 
translating the formant filter coefficients includes the steps of translating the 
fondant filter coefficients from input CELP format to a reflection coefficient 
CELP format, converting the model order of the reflection coefficients from the 
model order of the input CELP format to the model order of the output CELP 
format, translating the resulting coefficients to a line spectral pair (LSP) CELP 
format, converting the time base of the resulting coefficients from the input 
CELP format time base to the output CELP format time base, and translate the 
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resulting coefficients from LSP format to the output CELP ♦ a 

output formant filter coefficients Th* c . c *° P roduce 

codebook parameters J^T' t / P ° f tnmri « tfa « * e P*<* and 
Pitch and LdZT PS ° f l * nfl «k*B speech using the input 

output patch and codebook parameters using the target signal and Jt T^l 
formant filter coefficients. ^ ^ OUt P ut 

An advantage of the present invention is that it eliminate a 

« r h p.^, speech ^ nomiaUy ^ b ; 

BRIEF DESCRIPTION OF THE FIGURES 

become^moreT 8 ' "* °' *» will 

*keTl T !T ° m deSCTi P ti ° n Set forth Wow when 

taken m conjunction with the drawings in which like reference characters 
identify correspondingly throughout and wherein: 

and JSi^f ^ ° f 3 SyStCm ^ 

HQ 2 is a block diagram of a tandem coding system for converting from 
an input CELP format to an output CELP format; S 
FIG. 3 is a block diagram of a CELP decoder; 
FIG. 4 is a block diagram of a CELP coder; 

FIG. 5 is a flowchart depicting a method for CELP-based to CELP-based 
vocoder packet translation according to an embodiment of the present 
invention; F ni 

FIG. 6 depicts a CELP-based to CELP-based vocoder packet translator 
according to an embodiment of the present invention; 

FIGS. 7, 8, and 9 are flowcharts depicting the operation of a formant 
parameter translator according to an embodiment of the present invention; 

FIG. 10 is a flowchart depicting the operation of an excitation parameter 
translator according to an embodiment of the present invention; 

FIG. 11 is a flowchart depicting the operation of a searcher; and 

FIG. 12 depicts an excitation parameter translator in greater detail. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The preferred embodiment of the invention is discussed in detail below 
While specific steps, configurations and arrangements are discussed, it should 



BNSDOCID: <WO 00481 70A1JA> 



WO 00/48170 PCT/USOO/03855 



10 



15 



20 



25 



30 



35 



present invention. The present invention could find use in a vTiety of 
ctr^r Cati ° n ~ ***** satellite a^^lia 

^ A PrCferred is » CDMA --less 

spread spectrum communication systems for telephone service 

The present invention is described in two parts. First, a CELP codec 
mcludmg a CELP coder and a CELP decoder, is described. Then, a pack* 
translator is described according to a preferred embodiment. 

Before describing a preferred embodiment, an implementation of the 
exemplary CELP system of FIG. 1 is first described. In this implementation, 
CELP coder 102 employs an analysis-by-synthesis method to encode a speech 
signal. According to this method, some of the speech parameters are computed 
in an open-loop manner, while others are determined in a closed-loop mode by 
trial and error. Specifically, the LPC coefficients are determined by solving a set 
of equations. The LPC coefficients are then applied to the formant filter. Then 
hypothetical values of the remaining parameters (codebook index, codebook 
gam, pitch lag, and pitch gain) are used with the formant filter to synthesizes 
speech signal. The synthesized speech signal is then compared to the actual 
speech signal to determine which of the hypothetical values of the remaining 
parameters synthesizes the most accurate speech signal. 

A Code Excited Linear Predictive (CELP) Decoder 

The speech decoding procedure involves unpacking the data packets, 
unquantizing the received parameters, and reconstructing the speech signal 
from these parameters. The reconstruction consists of filtering the generated 
codebook vector using the speech parameters. 

FIG. 3 is a block diagram of a CELP decoder 106. CELP decoder 106 
includes a codebook 302, a codebook gain element 304, a pitch filter 306, a 
formant niter 308, and a postfilter 310. The general purpose of each block is 
summarized below. 

Formant filter 308, also referred to as an LPC synthesis filter, can be 
thought of as modeling the tongue, teeth and lips of the vocal tract, and has 
resonant frequencies near the resonant frequencies of the original speech caused 
by the vocal tract filtering, Formant filter 308 is a digital filter of the form 
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6 

1/A ( z ) = l-a 1 z 1 -...-a n 2- (1) 

The coefficients a, ... a „ of formant filter 308 are r . 

coefficients or LPC coefficients ^ *° 35 formant mter 

o^^^^i^ ° f T ~* «» «— * pu.se train 
by a complex ^^T™ SpeeCh » P roduced 

of air from the ^ E x ^!? ^ ** ^ " d °~ *« 
■ .,, „ _ . WgS - tXam P les of voiced sounds are the O in "low" and a 

constriln «tX^TtS^ T 11 ^ ^ ^ ^ * 
the IH in "these " |JL» k Examples of unvoiced sounds are 

uppers HtcK^a^in—l ^ ^ ^ * ^ 

VP(Z)=1/(1 .b2* L ) = l + bz%b J 2' a + ... 

where b is referred to the pitch gain of the niter and L is the pitch lag of the 
Codebook 302 can be thought of as modeling the turbulent noise in 
background no«e and silence, the codebook output is replaced bv r^T 

"tT. r C °t b °° k Vectore m «'«ted according to a codebook index I. The 
selected codebook vector is scaled by gain element 304 according to a codebook 
^ parameter G. Codebook 302 may include gain element 304 8 The ^£2 
the codebook is then also referred to as a codebook vector. Gain element 304 
can be unplemented, for example, as a multiplier 

Postfilter 310 is used to "shape- the quantization noise added by the 
™'? r — imperfections in the codebook. This noise cL be 

noticeable m frequency bands which have little signal energy, yet might be 

t — iarge signal eLgy. T f Hi 

advantage of tlus property, postfilter 310 attempts .0 put more quantization 
ncosemto perceptually insignificant frequency ranges, and 1 J noise into 
P^eptually s,gmficant frequency ranges. This postfiltering is discussed 
ta*-r „ J-H. Chen & A. Gersho. "Real-Time Vector APC SpLh Coding a, 
4800 bps wth Adaptive Postfiltering," in Proc ICASSP (1987) and N.S. Jayant & 
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V. Ramamoorfty, "Adaptive PostfiherLg of Speech," i„ Proc . ICASSP 829 _ 
(Tokyo, Japan, Apr. 1986). ^m>i>r 829-32 

In one embodiment, each frame of digitized speech contains one or more 

dir; 06 t or each subframe - a set ° f p»~ - ap P Hed : CEL ; 

decoder 106 to generate one subframe of synthesized speech .<„) The speech 

rZT tT codebook * dex 1 codebook ^ G ^ i ■ U^dTS 

b ^d formant niter coefficients a, ... a„. One vector of codebook 302 I seleLd 
according to index I, scaled according to gain G, and used to excite pitch filter 
306 and formant Mter 308. Pitch filter 306 operates on the selected codebook 
vector accordmg to pitch gain b and pitch lag L. Formant filter 308 operates on 
the signal generated by pitch filter 306 according to formant filter coefficients a 
. • • ^ to produce synthesized speech signal • (n). ' 

A Code Excited Linear Predictive (CELP) Coder 

The CELP speech encoding procedure involves determining the input 
parameters for the decoder which minimize the perceptual difference between a 
synthesized speech signal and the input digitized speech signal. The selection 
processes for each set of parameters are described in the following subsections 
The encoding procedure also includes quantizing the parameters and packing 
them mto data packets for transmission, as would be apparent to one skilled in 
the relevant arts. 

FIG. 4 is a block diagram of a CELP coder 102. CELP coder 102 includes 
a codebook 302, a codebook gain element 304, a pitch filter 306, a formant filter 
308, a perceptual weighting filter 410, an LPC generator 412, a summer 414, and 
a minimization element 416. CELP coder 102 receives a digital speech signal 
s(n) that is partitioned into a number of frames and subframes. For each 
subframe, CELP coder 102 generates a set of parameters that describe the 
speech signal in that subframe. These parameters are quantized and 
transmitted to a CELP decoder 106. CELP decoder 106 uses these parameters to 
synthesize the speech signal, as described above. 

Referring to FIG. 4, the generation of LPC coefficients is performed in an 
open-loop mode. From each subframe of input speech samples s(n) LPC 
generator 412 computes LPC coefficients by methods well-known in the 
relevant art. These LPC coefficients are fed to formant filter 308. 

The computation of the pitch parameters b and L and codebook 
parameters I and G however, is performed in a closed-loop mode, often referred 
to as an analysis-by-synthesis method. According to this method, various 
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hypothetical candidate values of codebook and ™^ 

a CELP coder to synthesize a speech sZ^ ^ ^ are ***** * 

•W for each guess is compared ^^^Z^^ 
The error signal r(n) that results from thTcomT ^ S(n) * SUm ™ r 414 " 
minimization element 416 xa comparison xs provided to 

of ^ z^jzzrz?^ 416 — d — 

combination *at mirumizes sLT^ " * e 

samples are provided to sum input of adder 414 PercJ! J 
utilized to weight the error at th* L„ • Z PtePce P tU111 ^eightmg is 
It is at a i , frequencies where there is less signal power 

_ Minimization element 416 conducts the search for the codebook and 
^parameters in two stages. First, minimization element 416 searches for'e 

SSST^ ^ * e piteh search *~ 15 no co ~- * 

codebook (G - 0). In mnumization element 416 all possible values for the pitch 
kg parameter L and the pitch gain parameter b are input to pitch fiher ^ 

£zr°\ element 416 *— *■ vaiues ° f l - d b ^ *r« 

r(n) between the weighted input speech and the synthesized speech 

^ *■ ,f itch la 5 L and P** b for the pitch filter are found, the 
codebook search is performed in a similar manner. Minimization element 416 
then generates values for codebook index I and codebook gain G. The output 
values from codebook 302, selected according to the codebook index I are 
multiplied m gain element 304 by the codebook gain G to produce the sequence 
of values used in pitch filter 306. Minimization element 416 chooses the 
codebook index I and the codebook gain G that nunimize the error r(n) 

In one embodiment, perceptual weighting is applied to both the input 
speech by perceptual weighting filter 410 and the synthesized speech by a 
weighting function incorporated within formant filter 308. In an alternative 
embodiment, perceptual weighting filter 410 may be placed after adder 414 
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CELP-based ,o CELP-based Vocoder Pacta Translation 

to as me t^ 8 f i r SSi0n ' *" ^ ,0 be ~« - 

codebocl »d P f t 8 ^ " tapUl " CEU> f °™ at *» -P«*- ""PU«- 

codebook and p,tch parameters and "input" fonnant m ^ coeffic . P 

LAew.se. the result of the translation is referred to as the "output pX 
havmg an output" CELP forma, that specifies "output" codebooi and P «ct 

such a translahon is to interface a wireless telephone system to me Le^fer 
exchanging speech signals. internet tor 

. k I 10 ' 5 I 3 fl ° WCharl depiCting * e method ^rding to a preferred 
embodunem. The translation proceeds in three stages. In tlfe firs, 

mpT^^rr of *■ tapm pa< ** - ^' * 

mpu, CELP fc t to me outpu , ^ ^ ^ 

^orjd s^ge, me pitch and codebook parameters of the input speech packet are 

^5<T 1 T' ^ 10 *• ° U *<" CELT forma,, as'hown in 

*ep 504_h me tturd stage, me outpu, parameters are quantized with the 
output CELP quantizer. 

en* J* 0 " .VT* 5 3 PaCk6t tTanSlat0r 600 acc ^ding to a preferred 
embodiment. Packet translator 600 includes a formant parameter translator 620 
and an excitation parameter translator 630. Formant parameter translator 620 
translates the input formant filter coefficients to the output CELP format to 
produce output formant filter coefficients. Formant parameter translator 620 
includes a model order converter 602, a time base converter 604, and formant 
filter coefficient translators 610A,B,C. Excitation parameter translator 630 
translates the input pitch and codebook parameters to the output CELP format 
to produce output pitch and codebook parameters. Excitation parameter 
translator 630 includes a speech synthesizer 606 and a searcher 608. FIGS 7 8 
and 9 are flowcharts depicting the operation of fonnant parameter translator 
620 according to a preferred embodiment. 

Input speech packets are received by translator 610A. Translator 610A 
translates the fonnant filter coefficients of each input speech packet from the 
input CELP format to a CELP format suitable for model order conversion. The 
model order of a CELP format describes the number of formant filter 
coefficients employed by the format. In a preferred embodiment-the input 
formant filter coefficients are translated to reflection coefficient format, as 
shown in step 702. The model order of the reflection coefficient format is 
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relevant art 6(1 T ! S * are well-known in the 

S^T 1 ^ 1 ^ CELP format reflerti - Sclent 

format formant filter coefficients, this translation is unnecessary 

Model order converter 602 receives the reflection coefficients from 

translator 610A and r n m,^ *u , maents tTOm 

*v - , "~ " " ,c " luuei or aer of trie reflection coefficients from 

the model order of the innnt rT?T v> t~ ^ ^ , 15:01,1 

CELP fa™* u P fannBt tC> ^ model order of output 

CELP format as shown in step 704. Model order converter 602 includes an 
mterpolator 612 and a decimator 614. When the model order of the m P ut Ce£ 
format , lower than the model order of the output CELP format, interpo W 
612 performs an mterpolation operation to provide additional coefficients, as 
shown m step 802. In one embodiment, additional coefficients are set to zero. 
When the model order of the input CELP format is higher than the model order 
of the output CELP format, decimator 614 performs a decimation operation to 
reduce the number of coefficients, as shown in step 804. In one embodiment, 
me unnecessary coefficients are simply replaced by zeroes. Such interpolation 
and decimation operations are well-known in the relevant arts. In the 
coefficient reflection domain model, order conversion is relatively simple 
making it a likely choice. Of course, if the model orders of the input and output 
CELP formats are the same, model order conversion is unnecessary. 

Translator 610B receives the order-corrected formant filter coefficients 
from model order converter 602 and translates the coefficients from the 
reflection coefficient format to a CELP format suitable for time base conversion 
The time base of a CELP format describes the rate at which the formant 
synthesis parameters are sampled, i.e., the number of vectors per second of 
formant synthesis parameters. In a preferred embodiment, the reflection 
coefficients are translated to line spectral pair (LSP) format, as shown in step 
706. Methods for performing such a translation are well-known in the relevant 
art. 

Time base converter 604 receives the LSP coefficients from translator 
610B and converts the time base of the LSP coefficients from the time base of the 
input CELP format to the time base of the output CELP format, as shown in 
step 708. Time base converter 604 includes an interpolator 622 and a decimator 
624. When the time base of the input CELP format is lower than the time base 
of the output CELP format (i.e., uses fewer samples per second), interpolator 
622 performs an interpolation operation to increase the number of samples, as 
shown in step 902. When the time base of the input CELP format is higher than 
the model order of the output CELP format (i.e., uses more samples per 
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second), decimator 624 performs a H^™*^ 

CELP for^.f • *. ° f COUrS6 ' 11 me ^ bas e of the input 

CELP format is the same as the time base of the output CELP format no tL 
5 base conversion is necessary. »nnai, no time 

coeff,V T Ti. at ° r • rCCeiVeS ^--corrected formant filter 

coefnaents from hme base converter 604 and translates the coefficients from the 

™1 ♦! T ° UtpUt CELP f0nnat to P roduce -tP-t formant filter 
coeffiaents as shown in step 710. Of course, if the output CELP format 
employs LSP format formant filter coefficients, this translation is unnecessary 
Quanuzer 611 receives the output formant filter coefficients from translator 
610C and quantizes the output formant filter coefficients, as shown in step 712 
In the second stage of translation, the pitch and codebook parameters 
V S ,,eXdtati0n " Parameters > of me *P« speech packet are 

s^w * ^ ^ ^ t0 *" ° UtpUt CELP for ™*' - ^own in 

step 504. FIG. 10 * a flowchart depicting the operation of excitation parameter 

translator 630 according to a preferred embodiment of the present invention 

Referring to FIG. 6, speech synthesizer 606 receives the pitch and 
codebook parameters of each input speech packet. Speech synthesizer 606 
generates a speech signal, referred to as the "target signal," using the output 
formant filter coefficients, which were generated by formant parameter 
translator 620, and the input codebook and pitch excitation parameters, as 
shown in step 1002. Then in step 1004, searcher 608 obtains the output 
codebook and pitch parameters using a search routine similar to that used by 
25 CELP decoder 106, described above. Searcher 608 then quantizes the output 

parameters. 

FIG. 11 is a flowchart depicting the operation of searcher 608 according 
to a preferred embodiment of the present invention. In this search, searcher 608 
uses the output formant filter coefficients generated by formant parameter 

30 translator 620 and the target signal generated by speech synthesizer 606 and 

candidate codebook and pitch parameters to generate a candidate signal, as 
shown in step 1104. Searcher 608 compares the target signal and the candidate 
signal to generate an error signal, as shown in step 1106. Searcher 608 then 
varies the candidate codebook and pitch parameters to miiumize the error 

35 signal, as shown in step 1108. The combination of pitch and codebook 

parameters that minimizes the error signal is selected as the output excitation 
parameters. These processes are described in greater detail below. 
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FIG. 12 depicts excitation parameter translator 630 in greater detail As 
T above, excitation parameter translator 630 indudes a speech 

mcludes a codebook 302A, a gain element 304A, a pitch filter 306A, and a 
formant filter 308A. Speech synthesizer 606 produces a speech signal based on 
exotataon parameters and formant filter coefficients, as described above for 
decoder 106. Specifically, speech synthesizer 606 generates a target signal sJn) 
usmg the mput excitation parameters and the output formant filter coefficients 
^put codebook index I, is applied to codebook 302A to generate a codebook 
vector. The codebook vector is scaled by gain element 304A using input 
codebook gain parameter G,. Pitch filter 306A generates a pitch signal using the 
**led codebook vector and input pitch gain and pitch lag parameters b, and L,. 
Formant filter 308A generates target signal s^n) using the pitch signal and the 
output formant filter coefficients ^ ... 3on generated by formant parameter 
translator 620. Those of skill would appreciate that the time base of the input 
and output excitation parameters can be different, but the excitation signal 
produced is of the same time base (8000 excitation samples per second, in 
accordance with one embodiment). Thus, time base interpolation of excitation 
parameters is inherent in the process. 

Searcher 608 includes a second speech synthesizer, a summer 1202, and a 
nununization element 1216. The second speech synthesizer includes a 
codebook 302B, a gain element 304B, a pitch filter 306B, and a formant filter 
308B. The second speech synthesizer produces a speech signal based on 
exatation parameters and formant filter coefficients, as described above for 
decoder 106. 

Specifically, speech synthesizer 606 generates a candidate signal s c (n) 
using candidate excitation parameters and the output formant filter coefficients 
generated by formant parameter translator 620. Guess codebook index I,, is 
applied to codebook 302B to generate a codebook vector. The codebook vector 
is scaled by gain element 304B using input codebook gain parameter G c . Pitch 
filter 306B generates a pitch signal using the scaled codebook vector and input 
pitch gain and pitch lag parameters b c and L c . Formant filter 308B generates 
guess signal s c (n) using the pitch signal and the output formant filter 
coefficients a OT ... a^. 

Searcher 608 compares the candidate and target signals to generate an 
error signal r(n). In a preferred embodiment, target signal s^n) is applied to a 
sum input of a summer 1202, and guess signal s c (n) is applied to a difference 
input of summer 1202. The output of summer 1202 is the error signal r(n). 
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Error stgnal r(„) is provided to a mrnimizatton element 1216 
Muunuzahon element 1216 selec* different combinations of coZok and 
ptch parameters and determines the combination that rrdnirnizes elr si^, 

ekmen, 416 of CE1P coder !02. The codebook and pitch parameters tha, res J! 
from th,s search are quantized and used with the formant fitter coefficient that 
are generated and quantized by the formant parameter translator of packet 
translator 600 to produce a packet of speech in the output CELP format 
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Conclusion 14 

The foregoing description of the preferred . - 

enable any person skilled intfw. a ^ ♦ P* eferred embodiments is provided to 

^ in tne art, and the generic principles defined herein m W „. „ . 

otner embodiments without the use of the inventive faculty "Z^Z*^ 
invention is not intended to be limited to ^ k J- ' prCSent 

t-r» Via _i - , uea to De ""^tea to the embodiments shown herein but ic 
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What Is Claimed Is: 
1. 



15 

CLAIMS 



™ a , apparatUS f ° r verting a compressed speech packet from 

one code exerted linear prediction (CELP) format to another, comprising- 

3 Parameter tranSlat ° r ** lwndate> m P Ut formant ^er 

coefficients havmg an input CELP format and corresponding to a speech packet 

to an output CELP format to produce output formant filter coefficients; and 

an exatauon parameter translator that translates input pitch and 
codebook parameters having an input CELP format and corresponding to said 
speech packet to said output CELP format to produce output pitch and 
codebook parameters. 

2. The apparatus of claim 1, wherein said formant parameter 
translator comprises: 

a model order converter that converts the model order of said input 
formant filter coefficients from a model order of said input CELP format to a 
model order of said output CELP format; and 

a time base converter that converts the time base of said input formant 
filter coefficients from a time base of said input CELP format to a time base of 
said output CELP format. 

3. The apparatus of claim 2, where said excitation parameter 
translator comprises: 

a speech synthesizer that produces a target signal using said input pitch 
and codebook parameters and said output formant filter coefficients; and 

a searcher that searches for said output codebook and pitch parameters 
using said target signal and said output formant filter coefficients. 

4. The apparatus of claim 3, wherein said searcher comprises: 

a further speech synthesizer that generates a guess signal using guess 
excitation parameters and said output formant filter coefficients; 

a combiner that generates an error signal based on said guess signal and 
said target signal; and 

a minimization element that varies said guess excitation parameters to 
nurrimize said error signal. 
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furthe/co^r™ ^ ^ 3 ' ^ ^ ^ converter 

«*c^^^^ *- — - -P- Want 
to produce third counts ^ * ~ * ^ 

furthel mp ^: aPParatUS * ^ * " ^ ™*» ^ inverter 

™ inte ^° lat0r folates said third coefficients to produce order 

corrected coefficients when said model order of said input C^P fo^at " 

lowerthansaidmodelorderofsaidoutputCELPformafaTd 

correctL^^ that d6dmateS Said coe ^ents to produce said order 
corrected coeffiaents when said model order of said input CELP format I 
higher than said model order of said output CELP format. 

comprises: ^ * ^ *' ^ ^ 

a codebook using said input codebook parameters to produce a 
4 codebook vector; F 

« ! ^ ffltCT ^ ^ ^ pitCh mter P^eters and said codebook 

6 vector to produce a pitch signal; and 

8 i» 3 ^ Said ° UtpUt formant mter coefficients and said 

b pitch signal to produce said target signal. 

"> 8. The apparatus of claim 7, wherein said guess excitation 

parameters include guess pitch filter parameters and guess codebook 
12 parameters, wherein said further speech synthesizer comprises: 

a further codebook using said guess codebook parameters to produce a 
14 further codebook vector; 

a pitch filter using said guess pitch filter parameters and said further 
16 codebook vector to produce a further pitch signal; and 

a formant filter using said output formant filter coefficients and said 
18 ™rther pitch signal to produce said guess signal. 

9. The apparatus of claim 2, further comprising: 
2 a first formant filter coefficient translator that translates said input 

formant filter coefficients to a fourth CELP format before use by said time base 
4 converter. 
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10. 



The apparatus of claim 2, further comprising- 

said time S b° nd ^ — es the output of 

4 folT ^ ^ fOUrth CELP *~* *> said output^CELP 

2 rpf,^ 11 " ^ aPPaiatUS ° f d3im 5 ' Wherein said C ELP format is a 

2 reflection coefficient CELP format. 

12. The apparatus of claim 9, wherein said fourth CELP format is a 
2 line spectral pair CELP format. 

, rxr p T' A meth ° d ^ COnVertm S a impressed speech packet from one 
z ^LP format to another, comprising the steps of: 

(a) translating input formant filter coefficients corresponding to a speech 
4 packet from an input CELP format to an output CELP format to produce output 

formant filter coefficients; and 
6 (b) translating input pitch and codebook parameters corresponding to said 

speech packet from said input CELP format to said output CELP format to 
8 produce output pitch and codebook parameters. 

14. The method of claim 13, wherein step (a) comprises the steps of: 

2 (i) converting the model order of said input formant filter coefficients from 
a model order of said input CELP format to a model order of said output CELP 
4 format; and 

(ii) converting the time base of said input formant filter coefficients from a 
6 time base of said input CELP format to a time base of said output CELP format. 

15. The method of claim 14, wherein step (b) comprises the steps of: 

2 synthesizing speech using said input pitch and codebook parameters in 

said input CELP format and said output formant filter coefficients to produce a 
4 target signal; and 

searching for said output pitch and codebook parameters using said 
6 target signal and said output formant filter coefficients. 

16. The method of claim 14, wherein step (i) comprises the steps of: 

2 translating said input formant filter coefficients from said input CELP 

format to a third CELP format to produce third coefficients; and 
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4 converting the model order of said third coefficients fmm ^ , , 

of said input CEIP Wtv,^ ^ , coerncients from a model order 

input L.fcLP format to a model order of said output CET P f™ * * 
6 Produce order corrected coefficients. a out P"t CELP format to 

2 ttLl me ^° d 1 ° f ^ 16 ' WhCrein StCp COm P rises *e ^eps of- 

2 translating said order corrected rnpffi,™^ . , P 

fourth coefficients; " ~ " wu " n ronnat to P r °duce 

4 said ZS?,* 8 ^ ^ ° f S3id fOUrth COe " ^rm a time base of 

8 tosaidtZut^f ^ ^ COrrCCted COeffiGientS ^ "* ^rmat 
said output CELP format to produce said output formant filter coefficients. 

2 the steps' of: ^ ^ * ^ ^ ^ uprises 

4 andsai^^^ 

6 signal;^"^ ^ ^ ^ ^ ^ ^ " nd "*> tar S et 

varying said guess codebook and pitch parameters to minimize said 
8 error signal. 

19. The method of claim 16, wherein step (i) further comprises the 
2 steps on 

interpolating said third coefficients to produce said order corrected 
4 coefficients when said model order of said input CELP format is lower than 

said model order of said output CELP format; and 
6 decimating said third coefficients to produce said order corrected 

coefficients when said model order of said input CELP format is higher than 
8 said model order of said output CELP format. 

20. The method of claim 16, wherein said third CELP format is a 
2 reflection coefficient CELP format. 

21. Thememodofdaiml^wheremsaidfourmC^LPformatisaline 
2 spectral pair CELP format. 
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^302 

TRANSLATE FORMANT FILTER COEFFICIENTS FROM 
INPUT CELP FORMAT TO OUTPUT CELP FORMAT 



TRANSLATE PITCH AND CODEBOOK PARAMETERS 
FROM INPUT CELP FORMAT TO OUTPUT CELP 



I r 506 

QUANTIZE OUTPUT PARAMETERS WITH 
OUTPUT CELP QUANTIZER 



FIG. 5 
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^702 

TRANSLATE FORMANT FILTER COEFFICIENTS FROM 
INPUT CELP FORMAT TO REFLECTION COEFFICIENT 

CELP FORMAT 



i_ ; r 704 

CONVERT THE MODEL ORDER OF THE REFLECTION 

COEFFICIENTS FROM THE MODEL ORDER OF THE 
INPUT CELP FORMAT TO THE MODEL ORDER OF THE 
OUTPUT CELP FORMAT 



1 


r 706 


TRANSLATE COEFFICIENTS TO A LINE SPECTRAL 
PAIR (LSP) CELP FORMAT 


< 


r 708 


CONVERT THE TIME BASE OF THE LSP COEFFICIENTS 
FROM THE INPUT CELP FORMAT TIME BASE TO THE 
OUTPUT CELP FORMAT TIME BASE 


< 


r 710 


TRANSLATE THE COEFFICIENTS FROM LSP FORMAT 
TO THE OUTPUT CELP FORMAT TO PRODUCE 
OUTPUT FORMANT FILTER COEFFICIENTS 






QUANTIZE OUTPUT FORMANT FILTER 
COEFFICIENTS USING OUTPUT CELP 
FORMAT 



FIG. 7 
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IMTCDPrvT A TC Tut? nnrrri/it^r^^ . . 

" ■ *■ v^x^/ v i >~ mi. ^ucrriL-itlN I 2> W MtJN i hit MODEL 
ORDER OF THE INPUT CELP FORMAT IS LOWER THAN 
THE MODEL ORDER OF THE OUTPUT CELP FORMAT 



. i r 804 

DECIMATE THE COEFFICIENTS WHEN THE MODEL 
ORDER OF THE INPUT CELP FORMAT IS HIGHER THAN 
THE MODEL ORDER OF THE OUTPUT CELP FORMAT 



FIG. 8 



902 



INTERPOLATE THE SAMPLES WHEN THE TIME BASE 
OF THE INPUT CELP FORMAT IS LOWER THAN THE 
MODEL ORDER OF THE OUTPUT CELP FORMAT 



904 



DECIMATE THE SAMPLES WHEN THE TIME BASE OF 
THE INPUT CELP FORMAT IS HIGHER THAN THE 
MODEL ORDER OF THE OUTPUT CELP FORMAT. 



FIG. 9 
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^1002 

SYNTHESIZE SPEECH USING THE INPUT PITCH 
AND CODEBOOK PARAMETERS TO PRODUCE 
A TARGET SIGNAL. 



^1004 

SEARCH FOR THE OUTPUT PITCH AND CODEBOOK 
PARAMETERS USING THE TARGET SIGNAL AND SAID 
OUTPUT FORMANT FILTER COEFFICIENTS. 



FIG. 10 
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C 

GENERATE A TARGET SIGNAL USING INPUT 
CODEBOOK AND PITCH PARAMETERS AND 
OUTPUT COEFFICIENTS 



1102 



c 

GENERATE A CANDIDATE SIGNAL USING 
CANDIDATE CODEBOOK AND PITCH 
PARAMETERS AND OUTPUT COEFFICIENTS 



1104 



i ^"06 

GENERATE AN ERROR SIGNAL BASED ON 
CANDIDATE SIGNAL AND TARGET SIGNAL 



VARY THE CANDIDATE CODEBOOK AND PITCH 
PARAMETERS TO MINIMIZE THE ERROR SIGNAL 



FIG. 1 1 
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